Efficient Management of Complex Striped Files in Active Storage
نویسندگان
چکیده
Active Storage provides an opportunity for reducing the bandwidth requirements between the storage and compute elements of current supercomputing systems, and leveraging the processing power of the storage nodes used by some modern file systems. To achieve both objectives, Active Storage allows certain processing tasks to be performed directly on the storage nodes, near the data they manage. However, Active Storage must also support key requirements of scientific applications. In particular, Active Storage must be able to support striped files and files with complex formats (e.g., netCDF). In this paper, we describe how these important requirements can be addressed. The experimental results on a Lustre file system not only show that our proposal can reduce the network traffic to near zero and scale the performance with the number of storage nodes, but also that it provides an efficient treatment of striped files and can manage files with complex data structures.
منابع مشابه
Extending DIRAC File Management with Erasure-Coding for efficient storage
The state of the art in Grid style data management is to achieve increased resilience of data via multiple complete replicas of data files across multiple storage endpoints. While this is effective, it is not the most space-efficient approach to resilience, especially when the reliability of individual storage endpoints is sufficiently high that only a few will be inactive at any point in time....
متن کاملEvaluation of Object Placement Techniques in a Policy-Managed Storage System
Storage management cost is a significant fraction of the total cost of ownership of large, enterprise storage systems. Consequently, software automation of common storage management tasks so as to reduce the total cost of ownership is an active area of research. In this paper, we consider a policy-managed storage system—a system that automates various management tasks—and focus on the problem o...
متن کاملEffective Delivery of Virtual Class on Parallel Media Stream Server
Virtual Class delivers most of its content through multimedia learning objects. To support such multimedia learning objects, its multimedia data service system should have a capacity to serve the growing number of clients and new data. A streaming server transfers multimedia files to clients from a repository of files in real time. The server must guarantee concurrent and uninterrupted delivery...
متن کاملFast and Efficient Log File Compression
Contemporary information systems are replete with log files, created in multiple places (e.g., network servers, database management systems, user monitoring applications, system services and utilities) for multiple purposes (e.g., maintenance, security issues, traffic analysis, legal requirements, software debugging, customer management, user interface usability studies). Log files in complex s...
متن کاملAchieving Efficient File Compression with Linear Cellular Automata Pattern Classifier
Files are created for Traffic Analysis, Maintenance, Software debugging, customer management at multiple places like System Services, User Monitoring Applications, Network servers, database management systems which must be kept for long periods of time. These Files may grow to huge sizes in this complex systems and environments. For storage and convenience files must be compressed. Most of the ...
متن کامل